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Abstract 

In order to produce specific complex structures from a large set of similar biochemical building blocks, 
many biochemical systems require high sensitivity to small molecular differences. The first and most com- 
mon model used to explain this high specificity is kinetic proofreading, which has been extended to a variety 
of systems from detection of DNA mismatch to cell signaUng processes. While the specification properties 
of the kinetic proofreading model are well known and were studied in various contexts, very little is known 
about its temporal behavior. In this work, we study the dynamical properties of discrete stochastic two 
branch kinetic proofreading schemes. Using the Laplace transform of the corresponding chemical master 
equation, we obtain an analytical solution for the completion time distribution. In particular we provide 
expressions for the specificity and the mean and the variance of the process completion times. We also 
show that, for a wide range of parameters a process distinguishing between two different products can be 
reduced to a much simpler three point process. Our results allow for the systematic study of the interplay 
between specificity and completion times as well as testing the validity of the kinetic proofreading model 
in biological systems. 

PACS numbers: 05.10.Gg,05.20.Dd,82.39.Rt 
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I. INTRODUCTION 



The strong bias toward the correct assembly of particular molecular constructs, or specificity, 
plays a key role in myriad biochemical processes such as DNA assembly, cell signaling, protein 
folding, and others. A common model accounting for the almost error free completion of these pro- 
cesses is kinetic proofreading, which was first suggested to explain the high specificity of protein 
synthesis [IJ. Similar motifs are common in various biological processes where multiple error- 
prone steps generate error-free results. For example, kinetic proofreading schemes are common 
in modeling of DNA synthesis, repair and replication [|2l 111 IH. Similar proofreading ideas ap- 
pear in other contexts such as protein translation [[Il|5l, molecular transport [|6l, receptor- initiated 
signaling ||71[8l|9l[l0l[IIllI3, RNA transcription [13], and other processes. 

Various aspects of the kinetic proofreading concept have already been studied. Hopfield ^ 
and Ninio [[T4l demonstrated the possible increases in specificity due to single step proofreading. 
Later explorations of similar proofreading models considered the multi-step proofreading process 
as a "black box", and studied the accuracy achieved by such processes IfTSl as well as the energy 
cost and optimal distribution of the proofreading effort along the proofreading chain [fT6ll . In 
IQ the kinetic proofreading was proposed as a model for the T-cell receptor explaining the high 
discrimination between foreign antigen and self antigen with only moderately lower affinity. In 
this context the specificity of a multi step process was studied again as well as the time delay 
between initial binding and output signal. 

In addition to process specificity, the time required to reach this specificity also plays an impor- 
tant role in biochemical processes. A proofreading strategy must be efficient as well as specific. 
In different contexts |[T71 [HI [191 l2Ql |2T1 it was shown that such completion or first passage times 
provide a wealth of information about the underlying systems. Extending these results to kinetic 
proofreading suggests that the characterization of the completion time distribution may help re- 
searchers to distinguish between different kinetic models and even support or oppose the existence 
of kinetic proofreading in specific systems. Surprisingly the completion time distributions of ki- 
netic proofreading schemes haven't been calculated before. 

In this article, we investigate the temporal behavior of different kinetic proofreading (KPR) 
schemes. We derive the chemical master equation (CME- [|22]| ') and its transform into the Laplace 
domain, which provides analytical expressions for the directional and non-directional completion 
time distribution. In particular, the zeroth, first and second derivatives of the CME's Laplace trans- 
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Figure 1 : Schematic description of the two-branch general kinetic proofreading scheme for error correction. 
The process begins at the point denoted with a star. From there it can hop right or left one jump at a time 
with rate k\ + k2. On the right half of the chain, the process can continue one step forward with rate ki, it 
can also move one step backward with rate ri or return to the initial point with rate 71 . On the left half of 
the chain, these rates are replaced with A;2, r2 and 72. The leftmost and rightmost sites are absorbing sites, 
once the particle reaches these points, the process is completed. If the particle finishes at the rightmost site, 
the process is said to have completed correctly, if it finishes at the leftmost site, the process has completed 
incorrectly. 

form provide expressions for the specificity, mean and coefficient of variation of the completion 
times. In turn these expressions provide a starting point to examine the tradeoffs between the 
stationary and temporal behaviors of different KPR schemes. Furthermore, we show that over a 
wide range of kinetic parameters the complex proofreading process reduces to a three-state pro- 
cess with simple distributions of the transition time between the three states. We also provide a 
diagram mapping the parameters space into classes of different behavior of the completion time 
distribution. This paper is organized as follows. In Section II, we introduce the model and provide 
its chemical master equation as well as the analytical solution of the CME in the Laplace domain. 
In Section III we show the different behaviors of the completion time distributions and divide the 
parameters space into regimes corresponding to different typical distributions. We also show the 
coefficient of variation versus the parameters of the problem and discuss it's meaning. In Section 
rv we summarize our results and their relevance to many of the problems previously studied in 
the context of kinetic proofreading. 
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II. THE MODEL 



Here we consider the general model of kinetic proofreading (KPR), which can be represented 
by the Markov chain in Fig. [1} The initiation state is represented by the star in the center of the 
chain, and is denoted by = (0,0). Depending upon the system, the state = (0,0) 

may have different meanings; in protein assembly this state may correspond to an empty A-site 
of the mRNA-ribosome complex yj, or in cell signaling the initiation state may correspond to a 
receptor with no bound ligand (3. The state just to the right of the star, labeled by (z, j) = (1,0) 
corresponds to a single step in the "correct" direction, i.e. the intended tRNA binds to the A-site 
or the proper ligand binds to the receptor. Conversely, a step to the left is in the wrong direction 
(wrong tRNA or wrong ligand). In general there may be many wrong directions or additional 
sub-chains branching from the central initiation point, but for simplicity we consider only the case 
where there is only one right and one wrong decision. The Markov system can transition one step 
away from the initiation point with rate ki in the correct direction or k2 in the incorrect direction. 
The process may also move one step toward the initiation point with rate ri or r2, or back to the 
origin with rate 71 or 72. The two branches of the chain have Li or L2 nodes correspondingly, the 
last of which, (Li, 0) or (0, L2) is an absorbing point (representing the formation of the relevant 
final product). The chemical master equation (CME) describing the dynamics of the occupation 
probabilities is: 



dpi,j jt) 
dt 



k2PQ,L2-l {t) 

- (^2 + 72 + P0,L2-1 (t) + k2P0,L2-2 (t) 

- {k2 + 72 + 1'2)P0.j (t) + k2Po,j-l (t) + r2P0j + l (t) 



for(z,j) = (0,i2) 

for(z,j) = (0,i2-l) 

for i = and < j < L2 - I 



Li-l L2-I 

-(fci + fc2)po,o (t) + ripifi (t) + r2Po,i (t) + 71 X! P*'" + X! -^^ " 

i=i j=i 



- (ki + 71 + ri)pifl (t) + kipi^ifl (t) + riPi+i^o (t) 

- {ki + 7i + ri)pLi-i,o {t) + kiPLi-2,0 {t) 
kiPLi-1,0 (t) 



for j = and < i < Li - 1 
for(z,j) = (ii-l,0) 
for(i,j) = (Li,0). 

(1) 

For any given specific case, this CME may be solved using various methods, such as various 
projection approaches Il23ll24ll25ll26ll27l . or simulated using stochastic simulations [|28ll29ll30l . 
Similarly, completions times for a given process could be calculated directly from the CME using 



projection approaches [1311 or analyzed using transition path and transition interface sampling 
ll32l l33l [34l [35l |36l. However, in this work we take an analytical approach in an effort to attain 
explicit expressions for the temporal behavior of the process in terms of the kinetic parameters. 
Later in Section III, those explicit expressions will better enable us to study the dependence of 
the specificity and completion time distributions on the system's parameters as the number of 
intermediate steps, and forward/backward/proofreading rates. More specifically, we first simplify 
the set of differential equation describing the dynamics of the occupation probabilities, by applying 
the Laplace transform: 

oo 

Pi,,{s) = jpi,,{s)e-''dt, (2) 



where we are using lowercase variables to represent quantities in the time domain and uppercase 
variables to represent the corresponding quantities in the Laplace domain. Upon application of the 
Laplace transform, the probabilities are now described by the following algebraic master equation 



^P0,L.-1 is) 

Li-1 L2-I 

s+fe,+fe2 I 1 + ^1^1." (^) + + ^1 E P^'O (s) + 72 E Po,j (s) I for = (0,0) 

i=i j=i 



for(z,j) = (0,L2) 
for(z,j) = (0,L2-l) 
for i = and < j < ^2 - 1 
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.+fci+7i+'ri (^1^^-1,0 (s) + riB^+1,0 (s)) 

s+fei+7i+''l ^^1-2,0 (s) 



for j = and < i < ii - 1 
for(z,j)-(ii-l,0) 

[^Pli-i.o(s) for(z,j) = (ii,0). 

(3) 

For the above equation we have already imposed the initial condition pij (t = 0) = 5i,o<^j,o> 
where 5 is the Kronecker delta. In other words, Po,o(0) = 1 and Pij{0) = for all (z, j) 7^ (0, 0). 
The general solution of these equations is explicitly written as 



P^,j is) 



AX[ + B\i forj = 0, i>0 

A/3i + BPi + Ci/3i - for ^ = 0, j > 



(4) 



The space independent parameters \i,2is) and ^1,2(3) are obtained from the solution of the 
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quadratic equations 



+ - A = 



s + fci + 7i + ri s + fci + 7i + ri 



P'-P = 0, (5) 



s + k2 + ^2 + r2 s + k2 + '^2 + r2 
which come from the expressions for Pij{s) at the interior points of the two branches. The bound- 
ary conditions are satisfied by proper choice of the coefficients A{s),B(s) and C(s). The boundary 
condition at = (0, 0) (see Eq.|3| is expressed as: 

{s + ki + k2){A + B) = 1 + ri(AAi + BX2) + r2{{A + B)p2 + C(/3i - P2)) 

Li-l L2-I 

+ 71 5^ {AX\ + BXi) + 72 J] {{A + B)Pi + C{P{ - . (6) 
i=i j=i 

The boundary condition at = (Li — 1,0) is written as: 

AX^^-' + BX^'-' = MA[^-2 + BX^'-') , (7) 

and the boundary condition at (0, L2 — 1) is 



S + ft2 + 72 + ^2 



Using the definitions of Ai,2 (see Eq.[5]) we can rewrite Eq. ^ as 

^A^ 

'a 



(8) 



B = -A^. (9) 



Similarly using the definitions of Pi^2 we rewrite Eq. ([8]) as 



Finally, using Eqs. ( 9|10 ) one can simplify Eq. ([6]) 



c = a :i (10) 



(11) 



A^-^ /^ 1-A^ Af^ 1 - A^^ 
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Note that in deriving Eqs.( |9|10|l 1 ) we assumed that the parameters ki, k2,ri,r2, 7i, 72 are differ- 
ent than zero. 

In order to study the temporal behavior of the kinetic proofreading model, we compute (i) 
the probability that the system will reach the correct terminus point and (ii) the distribution of 
time until the system reaches one of the two possible terminus points. Both of these quantities 
are found by examining the probability density function (PDF) for the first passage time to the 
absorbing sites (Li, 0) or (0, L2) which are given by: 



/l (t) = /EiPLi-I.O (t) 

/a {t) = k2Po,L2-i (t) ■ 



(12) 



According to Eqs. ( 12 1 and d4]) the Laplace transform of the first passage time PDF is given by 



F2 {s) = k2 {0(3^''-^ + {A + B- C)P^^-^) . 



(13) 



This expression now contains a wealth of information about the moments of the escape time 
distributions. For example, the probability of reaching the correct absorbing site, (i, j) = (Li, 0), 
is found by evaluating Fi{s) at s = 0. Furthermore, the m*^ moment of the arbitrary completion 
time is 



T^^) = I t^{f, it) + f2 {t))dt 

'dFi{s) dF2{s) 



ds 



ds 



(14) 



s=0 



and the m normalized moment of the escape time to the correct site (i, j) = {Li, 0) is: 



n 



(-1)™ fdF^{s) 



Fi{0) 



ds 



(15) 



s=0 



III. RESULTS AND DISCUSSION 



The non-normalized Laplace transforms of the two branches, Fi{s) and -^2(5) provide a com- 
plete description of the completion process and in particular, we analyze two important quantities: 
(1) the probability that the process completes via one branch or the other and (2) the distribution 
of time needed for this completion. In the latter case, we concentrate our attention on the mean 
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and variance of the completion times. For the general two-branch process, it is relatively simple to 
generate symbolic expressions for the completion probabilities and the moments of the completion 
times. Where these expressions are simple enough to be informative, we will provide their explicit 

forms for which we will use the following notation 

^1,2 = Ai,2|s=o; ^'1,2 = /3i,2|s=o; and Ao = ^|s=o- (16) 

Where the expressions are not sufficiently compact, particularly for the higher moments of the 
completion time distributions, we will use numerical examples to illustrate their dependence on 
parameters. For these numerical examples, we fix the length of each branch to involve L1 — L2 — 
16 steps. To explore the effect of different time scales in each branch, we will consider the case 
when the forward rates of both branches are equal (fci = and the case where the forward rate 
of the "correct" branch is six times that of the "wrong" branch {ki — 6/C2). 



A. "Correct" and "Wrong" Completion Probabilities 

In a kinetic proofreading process, the biochemical process must somehow give preference to 
completing in the correct way, i.e. adding the correct amino acid to the growing protein chain or 
initiating intracellular signaling when the correct ligand is bound to the receptor, but not when 
the incorrect ligand is bound. In our simplified model, this preference corresponds to reaching 
one absorbing site rather than the other. Here we analyze how changes in the relevant parameters 
affect this preference. Following the derivations in the previous section we can write the "correct" 
or "wrong" completion probabilities {Pc and Pw, respectively) as 

Pc^F^{Q)^k^l['-\l-h/h)Ao, 

For example, one can use these expressions to derive expressions for the directional completion 
probabilities for the directed kinetic proofreading (dKPR) scheme (71^2 > and = 0) which 
are 

(A;iA2)(l + V'2)'''~' 



C-dKPR 



(l + V'i)'^^"' + (A;iA2)(l + V'2)' 



_ (1 + ^1)" .... 

[l+lpl) + {kl/k2){l+1p2) 
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where we have used the notation ipi,2 = 71,2/^1,2- 

Fig. |2]\ shows the probability of completing in the first direction as a function of the kinetic 
proofreading rates ipi^2 in the case of equal forward rates (ki = k2 = 1). From the figure, it is 
apparent that a large amount of specificity is achievable for the properly chosen combination of ipi 
and ijj2- For example, the system will complete in the correct direction more than 99.99% percent 
of the time for any ii'i,ip2) combination in the lower right comer. Similarly, one can compute 
the directional probabilities in the case of the absorption mode (AM) process (see Fig.|2^), where 
7i 2 = but the backward rates ri 2 are allowed to vary. In this case the contour lines for the 
completion probabilities are less trivial than for the dKPR case. In particular, the contour lines 
exhibit a bottle neck near the values of 61^2 = ^"1,2/^1,2 = 1 where the specificity can change 
dramatically despite relatively small changes in the parameter values. 

The objective of kinetic proofreading is to provide large amplification in directional specificity 
despite small changes in the parameters ^ or 9. To compare how well the dKPR and AM processes 
achieve this objective we have drawn red dashed lines in each plot corresponding to tpi = 0.8-ip2 
or 9i = 0.86*2, there is a twenty percent difference in the relative proofreading or backward 
ratios, respectively, between the two branches. Since ki = k2, this is equivalent to exploring to a 
20 percent different in the actual rates 7 and r. As the backward and proofreading rates increase, 
the specificity also increases for both process, as can be seen by how the dashed lines cross the 
contour levels. The first observation to note is that both the dKPR and the AM process can attain 
90% specificity with twenty percent difference in rates (see stars in Figs.[2]\-B) and values of the 
parameters which are within the range of the plots. 

Figs. [3]^-B show the completion probabilities for a case where the forward rates are different 
from one branch to the next. While many qualitative trends of this case are similar to the previous 
case with equal forward rates, the analysis becomes a little more complicated. First, the fact of 
different rates already provides a certain amount of correction (ki/{ki + ^2) = 6/7) before any 
additional effects of proofreading or backward rates. In turn, the proofreading and backward rates 
can amplify this specificity much higher than in the previous case for similar relative changes in 
parameters from one branch to the next. In this case, because the two branches have different 
forward rates, one can consider small relative changes in the ratios {tjj or 9, red dashed lines) or 
in the absolute rates (7 or r, blue dashed lines). In the former case, with a twenty percent change 
in the ratios {tpi = 0.8'ip2 or 9i = 0.86*2), either process can attain a 90% specificity (white stars) 
but only the AM process is capable of providing 99% specificity (pink star) within the parameter 
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Figure 2: Proofreading with Equal Forward Rates, ki = k2 = 1- Contour plots of the probability of correct 
completion (A,B) and the conesponding mean decision time (C,D) for two different decision processes. 
(A,C) For the dKPR process with varying kinetic proofreading rates ipi = 71 / ki and V'2 = 72/ ^2 and zero 
backward rates, ri 2 = 0. (B,D) For the AM process with varying backward rates 61 = ri/ki and 62 = 
r2/k2 and zero proofreading rates, 71 2 = 0. For both plots, the lengths of the branches are Li = L2 = 16, 
and the contour lines denote the probabilities of correct completion (upper panels) or mean completion 
time in units of 1/A;2 (lower panels). The red dashed line corresponds to a twenty percent difference in the 
proofreading or backward ratios, V'l = 0.8V'2 or 0i = O.862, respectively. 

range shown in the figure. In the latter case, when the actual rates 7 or r are only slightly varied 
from one branch to the other another (blue dashed lines) far greater specificity is achievable with 
either model. Indeed, a high level of specificity is achievable in either process even when these 
rates are identical so long as the forward rates are different (not shown). 
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Figure 3: Proofreading with Different Forward Rates, ki = 6 and ^2 = 1- Contour plots of the probability 
of correct completion (A,B) and the corresponding mean decision time (C,D) for two different decision 
processes. (A,C) For the dKPR process with varying kinetic proofreading rates t/ji = 71 / ki and Tp2 = 72/ ^2 
and zero backward rates, ri 2 = 0. (B,D) For the AM process with varying backward rates 9i = ri/ki and 
^2 = ^2/ ^2 and zero proofreading rates, 71 2 = 0. For both systems, we have set the forward rates to ki = 6, 
k2 = 1, and the lengths to Li = L2 = 16. The contour lines denote the probabilities of correct completion 
(upper panels) and mean completion time in units of 1/A;2 (lower panels). The red dashed line corresponds to 
a twenty percent difference in the proofreading or backward ratios, ijji = 0.8'i/'2 or 61 = 0.86'2, respectively. 
The blue dashed line corresponds to a twenty percent difference in the proofreading or backward rates, 
71 = 0.872 or ri = 0.8r2, respectively. 

B. Average Completion Times 

In the perspective of kinetic proofreading, in addition to forming the correct product, a process 
must complete this construction in a timely manner. For example, the AM and dKPR schemes 
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may make the same amplification of specificity, but one may be able to do so faster than the other. 
While a detailed analysis of this tradeoff between specificity and efficiency is left for future work, 
we begin to explore this aspect of the system by examining the mean completion time. Although 
the expressions for the mean completion times are trivial to generate, they are cumbersome to write 
in the general case. Therefore, in the interest of brevity, we provide explicit expressions only for 
the case of directed kinetic proofreading, for which the mean "correct" completion time is given 
by 



C-dKPR 



(fciAs) (1 + V^l) 


1 - 




+ ij2 


f ^2) + 


{h/h)L2{ 


1+^1)] 


kiiJ2 (1 + ^i) 




(1 + + A2) 


(1 + ^1 


) (1 + V^2)^^" 





{h/k2){l+^2f' [1+^1(2 + ^1)] 








(1 + ^1)^^ (1 + ^2) + ih/k2) (1 + ^1) (1 + ^2^' 



(19) 



Similarly, we find the mean "wrong" completion time 



ly-dKPR 







" (l + V;2)^^ + (^i/^2) 


'l-il+^P2t' 






(1 + ^1)^^ (1 + ^2) + (fci A2) (1 + ^1) (1 + ^2)''' 





Li (1 + V'2)''' - {ki/k2) {L2 - 1) (1 + ^1) (1 + ^2) 



L2-1 



ko 



(1 + ij^r (1 + ^2) + {k,/k2) (1 + ^1) (1 + ^2) 



L2 



(20) 



The average arbitrary completion time (without specifying correct or wrong completion) is 



(1 + (1 + ^2)''' - (1 + - (1 + ^2) 



dKPR 



\L2 



k. 



(1 + V^i)^^ (1 + ^2) + A2) (1 + i^i) (1 + ^p2y 



(l + V-l)"^ l-(l + ^2)"' +(7/>2M)(l+^l)''' l-(l+^l) 



k2^2 



(21) 



;i + V^i)^^ (1 + V^2) + {k,/k2) (1 + ^1) (1 + i^2t' 

Figs.[2]C-D show contour plots for the average completion times of the dKPR and AM processes 
for ranges comparable to the specificity plots in Figs. [2|^-B and ki^2 = 1- From these plots, we 
can observe that as the backward or proofreading rates increase, the amount of time required to 
complete the process increases exponentially. As before in Figs.|2]\-B, the dashed line denotes the 
lines where ^1 = 0.8^2 or 9i = 0.86*2 and the stars represent the crossings of the 90% specificity. 
While we saw in Figs. [2j\-B that both processes were able to provide 90% specificity (for 20% 
difference in the backward/proofreading rates), the AM process can provide it with a much smaller 
mean completion time. Similarly, Figs. [3t-D show contour plots of the mean completion times 
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of the dKPR and AM processes with ki = 1 and k2 = 6. The white/pink/black stars denote the 
90%, 99%, 99.9% specificities correspondingly. The red dashed fines correspond to 9i = 0.802 
(or ipi = 0.8^2) and the blue dashed lines correspond to ri = 0.8r2 (or 71 = O.872). We can see 
again that for a 20% difference backward/proofreading rates (blue dashed lines) or their ratios to 
the corresponding forward rates (the red dashed lines) the AM process can provide the requested 
specificity for much smaller average completion times. 

To better understand the behavior of the mean completion time, we illustrate in Fig. |4] the ef- 
fects that changes in the parameters ^12 have on these mean completions times for the process in 
which the forward rate on the correct branch is six times the rate on the wrong branch, ki = 6/^2. 
At first glance at Fig. |4]\ or Fig. |3p it appears that the behavior of the mean arbitrary comple- 
tion time is somewhat trivial, as one increases the proofreading rates in both branches, the mean 
waiting time also increases. However, by zooming in along certain strips of this plot, one finds 
additional dependencies of the mean waiting times on the parameters. Suppose that one fixes ipi 
to some non-zero value and then changes '(p2 (see top edge of Fig. |4^). When ^/'2 is zero, the sec- 
ond branch is biased forward and the process will quickly complete soon after it enters into that 
branch. Conversely, when ^2 is very large, the process will spend very little time in the second 
branch and the process reduces down to the single branch process as if that second branch were 
not there. However, when ^2 is in some middle range, the process will spend significant amounts 
of time in each of the two branches, thereby increasing the total time until the completion. Similar 
observations can be made for the AM process (not shown), as should be expected from the non 
trivial shape of the contours of Fig. |3p. 



C. Variance in Completion Times 

In addition to specificity and the average time to arrive at that specificity, a completion pro- 
cess is further characterized by the shape of the distribution for its completion time. For some 
parameters this process will have little variance, and the decision is made in some seemingly fixed 
deterministic amount of time. For other parameters this decision may be much more broadly 
distributed (the same behavior was found for a single branch processes, see ll37l ). The relative 
broadness of this shape can be described by the squared coefficient of variation (variance divided 
by the mean squared, CV^ = o"^//i^) of the completion time distribution. The second moments, 
and therefore the variances, can be derived according to the general relation of Eqs. ( 14|15 1, but 
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Figure 4: Plots of the mean arbitrary completion times (units of 1/ for the directed kinetic proofreading 
process, with two branches of lengths Li = L2 = 16, forward rates ki = 6 and k2 = 1. Panels B,C show 
a zoomed in perspective of the mean completion times corresponding to the parameter regions indicated in 
panel A. 

the resulting expressions are too long to provide much valuable insight even in the case of directed 
kinetic proofreading. Instead, we rely on parametric studies to explore how parameters affect the 
completion time distribution shapes. 

In what follows we consider the same cases as above and classify the shapes of the resulting 
completion time distributions. First, we consider the case of zero proofreading rates, 71 2 = 0. 
Fig. |5] shows a contour plot of the coefficient of variation of the arbitrary completion time versus 
6*1 = ri/ki and 62 = r2/k2 and typical completion time distributions for the parameter values 
ki = 6k2 and {(^i, ^2)} = {(2, 1), (1.2, 1.2), (0, 0), (0, 0.88)}. This plot allows us to divide the 
parameters space into a few regions with different shapes for the completion time distribution. 
The large green area (color online) in the upper right comer corresponds to 0.9 < CV^ < 1.1, 
where the completion time distribution is often well approximated by an exponential distribution. 
The corresponding side panel (Fig.[5p) shows the "correct" (red) and "wrong" (blue) completion 
time distributions as well as the arbitrary completion time distribution (green). In this case, all 
three distributions are almost exponential, with the small exception of their left tails. The red areas 
(color online) where the coefficient of variation is < CV^ < 0.2 correspond to cases where one 
branch is strongly biased backwards while the other is biased forward. For these, the completion 
time along the backward biased branch is nearly exponential, while the completion time along 
the forward biased branch is effectively described by a gamma distribution (see Fig. [5}\). Since 
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the process is far more likely to finish along the forward biased branch, the total completion time 
distribution is also well approximated by a narrow Gamma distribution as illustrated in Fig. [5j\. 
The bottom left panel shows the distributions in the case where both branches are biased forward 
^1 = = 0. In this case, the completion time distribution for each branch is a Gamma distribution, 
and the total completion time distribution is a simple combination of the two, since the probability 
to complete at each of the branches is proportional to the forward rate at that branch. As a result 
the total completion time has a bimodal distribution as shown in Fig. |5p. The final area of interest 
(shown in blue online) corresponds to the conditions where the coefficient of variation is greater 
than 1.1, such that the total completion time distribution is broader than exponential as is shown in 
Fig. [5^ for the point of maximal CV^. Due to the fact that motion in one branch is strongly biased 
forward while motion in the other branch is almost unbiased, we obtain a non-trivial combination 
of the two behaviors in the total completion time distribution. 

We now consider the case where there is proofreading (71 2 > 0) but where the backward rates 
are set to zero, ri 2 = 0. Fig. [6] shows a contour plot of the coefficient of variation of the arbitrary 
completion time versus ^1 = 71 //ei and ip2 = 72/^2 and typical completion time distributions 
for the parameter values h = 6k2 and {{'^1,^2)} = {(0.4,0), (0.3,0.3), (0,0), (0.05,0.1)}. As 
above in Fig. |5} we can divide the parameters space into few regions with different shapes for 
the completion time distribution. For example the large green area (color online) corresponds 
to a coefficient of variation near one and where the directional and arbitrary completion time 
distributions are well approximated by exponential distributions (see Fig. [6p). Similarly, for the 
small red areas where one branch is biased backwards and the other forward, the completion 
time along the backward biased branch is nearly exponential, while the completion time along the 
forward biased branch is effectively described by a gamma distribution (see Fig.[6j\). 

We now turn to the more general case where there is both proofreading and a backward reac- 
tions (71 2 > 0, ri 2 > 0). For this case. Fig. |7] shows a 3D plot of the coefficient of variation of 
the arbitrary completion time vs. 61^2 (upper line) or ^/^i 2 lower line. These figures emphasize the 
different effect of changes in 6* or ^. While in all cases strong backward bias on both branches 
(large 61^2 or ^1,2) lead to an exponential distribution of the completion time, backward bias has 
different dependence on the system size and different ranges for 9 and ip. 
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Figure 5: Contour plot of the coefficient of variation (of tfie arbitrary completion time) versus ri/ki and 
r2/k2 and typical completion time distributions. We used the case of zero proofreading rates, 71 2 = 0. We 
also set ki = Q and k2 = I. The different colors correspond to different behavior of the completion time 
distributions (see text for more details). The side panels (A-D) show the distributions of completion times 
in the correct (red) and incorrect (blue) directions and the arbitrary completion time distribution (green). 
The inset in each of the panels shows a semi log plot of the distribution to amplify the differences between 
the lines. 

D. Simplification of the Two-Branch Decision Process 



In examining the distributions in Figs.|5]A-D, one observes that the completion time distribution 
of each branch is often similar to a gamma distribution (or an exponential distribution, which is a 
special case of the gamma distribution). This suggests that one should frequently be able to replace 
the entire process with a simple three state chain as shown in Fig. [8] with the following properties. 
Each direction (1,2) is assumed to have a non-normalized Gamma distributed completion time 
with density 

r(a;i) 
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Figure 6: Contour plot of the coefficient of variation (of the arbitrary completion time) versus ipi = ji/ki 
and ^2 = 72/^2 and typical completion time distributions. We used the case of zero backward rates, ri 2 = 
0. We also set ki = 1 and k2 = 6. The different colors correspond to different behavior of the completion 
time distributions (see text for more details). The side panels show the distributions of completion times in 
the correct (blue) and incorrect (red) directions and the arbitrary completion time distribution (green). The 
markers correspond to the best fit for a reduced 3-state model approximation to the processes. The inset in 
each of the panels shows a semi log plot of the distribution to amplify the differences between the lines. 

where < a < 1 denotes the probability of completion in the first direction. Thus, the total 
probability density of completing along either branch at time t is approximated by: 

frit) ^ frit) = + /2(t,X2,?/2). 

In numerical studies, we have attempted to find parameter sets A = {xi, yi,X2, 1/2, a} that best 
match the direction and time distribution of the full escape process in the one norm sense. In other 
words, we have found the A such that: 

A = argmin / 

{xi,yi,X2,y2,oi} Jq 

In most cases, we find that this approximation and optimization does an excellent job of capturing 
the qualitative and quantitative behaviors of the complete process as is shown in Figs. [8|^-D. To 
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/„(t)-/„(t,A) 



dt. 



(22) 




Figure 7: The coefficient of variation versus 9i and 62 or ^^Jl and ^'2- In the upper line we fix the ratio 
between the proofreading rate and the forward rate (^"1,2) in both branches and show the effect of changing 
the ratios between the backward and forward rates 6*1^2- In the bottom line we fix 2 and show the effect 
of changing V'i,2- In all cases as both branches are strongly backward biased CV ~ 1 and the completion 
time distribution is exponential. Further discussion appears in the text. 

further explore the ability of the reduced model to capture the behavior of the full system, we have 
explored the original parameter space {61,92} in order to find the regions where this approximation 
is most valid. From Fig. |9]\, we immediately see that the approximation is valid in all four comers 
of the contour plot where both 9i and 62 are either relatively large or relatively small-that is 
where both branches are biased in one direction or another. However, even in the regions where 
one or both branches are unbiased (^^i ^ 1 or 6*2 ~ 1), we note that the fit is still quite good. 
Indeed for this system, we can always find a parameter set {xi,yi, X2, 1/2, «} that captures the full 



escape time distribution within error (defined by the norm in Eq. (22 1) of 0.2. In order to illustrate 
this approximation success. Fig. [9^ shows the actual (solid line) and approximate (dashed line) 
distributions for the case (9i = 1.03, 6*2 = 0.95), of the worst fit. For every other case, we were 
able to find a three state model that did an even better job of matching the full system behavior. 
As was the case for the AM process (71,2 = 0), the dKPR process (ri 2 = 0) is well captured 
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Figure 8: Three state model approximation of the original completion time problem. (Top) Schematic 
description of the three state model where the conditional escape time in each direction is given by a gamma 
distribution. (A-D) Comparison of the escape time distributions using the full original and the reduced three 
state model. The parameters used here are the same as those in Figs. |5j A-D). 

by the same three state process defined above. To illustrate this, the colored lines in Figs. [6}A-D 
correspond to the full system completion time distributions, and the markers correspond to the 
approximate three state system. 



IV. CONCLUSIONS 



In this work we have begun the exploration of the temporal properties of kinetic proofreading 
schemes. To accomplish this, we have derived analytical expressions for the Laplace transform of 
the occupation probabilities from which we obtained the completion time distributions. With this 
analysis, we have enabled the simple derivation of expressions for the completion time moments. 
Some of these expressions, such as completion probabilities and the mean waiting times for certain 
processes are simple enough to be shown explicitly, while others are just as easily derived, but are 
omitted since their form is too long and not very informative. To enable a better understanding 
of the interplay of specificity and temporal behaviors, we focused on the first two moments of the 
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Figure 9: Numerical comparison of the completion time distributions for the approximate 3-state model 
and the full two branch process. (A) Contour plots of the approximation error (the norm of the difference 



between the actual and the approximate joint distributions (see Eq. (22i)) versus the ratios (6*1 2 = 2/^1,2)- 



(B) Illustration of approximate (dashed line) and actual (solid line) completion time distributions (in units 
of 1/ /C2) for the parameter set {9i = 1 .03, 62 = 0.95), which corresponds to the largest approximation error 



dt = 0.20. 



completion times as well as on the completion probabilities (which is actually the zeroth moment). 
We showed that for most parameter sets, each of the considered proofreading schemes can be 
reduced to a three state process with simple distributions for the waiting times between transitions. 
The simplified process captures most of the relevant features of kinetic proofreading schemes, 
namely, the specificity as well as the magnitude and shape of the completion time distributions. 
However, the dependence of the simplified behavior on the full system's kinetic parameters is 
different for the various proofreading schemes, suggesting that some important information about 
the process is retained despite the simplification. 

We have explicitly considered different kinetic schemes including the traditional directed ki- 
netic proofreading (dKPR) scheme where catastrophic reactions force the process to restart as well 
as an absorption mode (AM) where single step intermediate reactions can provide the same speci- 
ficity. Surprisingly, we find that in most cases the simpler AM process outperforms the dKPR 
process by providing a higher degree of specificity in a shorter amount of time. It is also worth 
mentioning that the dKPR or general kinetic proofreading processes violate the detailed balance 
conditions and therefore are necessarily non-equilibrium processes. The AM process on the other 
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hand may satisfy the detailed balance condition and in this case is an equilibrium process. In 
this sense, the AM process has the added advantage in that it conserves energy, while the dKPR 
process must be continually driven with externally applied energy. 

High specificity appears in many biological systems and likely results from many different ki- 
netic schemes-suggesting that one needs as much information as possible to distinguish between 
one such mechanism and the next. Therefore, in addition to using the specificity and mean com- 
pletion times to compare the different processes, we have also used analyses of the completion 
time distributions to classify different kinetic schemes and parameter values into separate regimes 
where these distributions take on different qualitative shapes. By providing this additional infor- 
mation, the temporal analysis and classification tools developed here can more precisely support or 
oppose hypotheses of particular kinetic proofreading models for particular biochemical systems. 
In the future, the next logical step is to apply these tools in order to identify parameters and infer 
kinetic mechanisms from experimental measurements of completion time distributions. 
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